Search CORE

289 research outputs found

Quality Evaluation of Machine Learning-based Point Cloud Coding Solutions

Author: Pereira Manuela
Pinheiro Antonio M. G.
Prazeres João
Rodrigues Rafael
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2022
Field of study

In this paper, a quality evaluation of three point cloud coding solutions based on machine learning technology is presented, notably, ADLPCC, PCC_GEO_CNN, and PCGC, as well as LUT_SR, which uses multi-resolution Look-Up Tables. Moreover, the MPEG G-PCC was used as an anchor. A set of six point clouds, representing both landscapes and objects were coded using the five encoders at different bit rates, and a subjective test, where the distorted and reference point clouds were rotated in a video sequence side by side, is carried out to assess their performance. Furthermore, the performance of point cloud objective quality metrics that usually provide a good representation of the coded content is analyzed against the subjective evaluation results. The obtained results suggest that some of these metrics fail to provide a good representation of the perceived quality, and thus are not suitable to evaluate some distortions created by machine learning-based solutions. A comparison between the analyzed metrics and the type of represented scene or codec is also presented.This research was funded by the Portuguese FCT-Fundação para a Ciência e Tecnologia under the project UIDB/50008/2020, PLive X-0017-LX-20, and by operation Centro-01-0145-FEDER-000019 - C4 - Centro de Competencias em Cloud Computing.info:eu-repo/semantics/acceptedVersio

UBibliorum repositorio digital da ubi

On the stability of point cloud machine learning based coding

Author: Pereira Manuela
Pinheiro Antonio M. G.
Prazeres João
Rodrigues Rafael
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/10/2022
Field of study

This paper analyses the performance of two of the most well known deep learning-based point cloud coding solutions, considering the training conditions. Several works have recently been published on point cloud machine learning-based coding, following the recent tendency on image coding. These codecs are typically seen as a set of predefined trained machines. However, the performance of such models is usually very dependent of their training, and little work has been considered on the stability of the codecs’ performance, as well as the possible influence of the loss function parameters, and the increasing number of training epochs. The evaluation experiments are supported in a generic test set with point clouds representing objects and also more complex scenes, using the point to point metric (PSNR D1), as several studies revealed the good quality representation of this geometry-only point cloud metric.Research funded by the Portuguese FCT-Fundação para a Ciência e Tecnologia under the project UIDB/50008/2020, PLive X-0017-LX-20, and by operation Centro-01-0145-FEDER-000019 - C4 - Centro de Competencias em Cloud Computing.info:eu-repo/semantics/acceptedVersio

UBibliorum repositorio digital da ubi

Subjective Quality Evaluation of Point Clouds Using a Head Mounted Display

Author: Pereira Manuela
Pinheiro Antonio M. G.
Prazeres Joao
Rodrigues Rafael
Publication venue
Publication date: 29/10/2023
Field of study

This paper reports on a subjective quality evaluation of static point clouds encoded with the MPEG codecs V-PCC and G-PCC, the deep learning-based codec RS-DLPCC, and the popular Draco codec. 18 subjects visualized 3D representations of distorted point clouds using a Head Mounted Display, which allowed for a direct comparison with their reference. The Mean Opinion Scores (MOS) obtained in this subjective evaluation were compared with the MOS from two previous studies, where the same content was visualized either on a 2D display or a 3D stereoscopic display, through the Pearson Correlation, Spearman Rank Order Correlation, Root Mean Square Error, and the Outlier Ratio. The results indicate that the three studies are highly correlated with one another. Moreover, a statistical analysis between all evaluations showed no significant differences between them

arXiv.org e-Print Archive

Motion estimation with chessboard pattern prediction strategy

Author: Amirpour Hadi
Ghanbari Mohammad
Pereira Manuela
Pinheiro Antonio M. G.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Due to high correlations among the adjacent blocks, several algorithms utilize movement information of spatially and temporally correlated neighboring blocks to adapt their search patterns to that information. In this paper, this information is used to define a dynamic search pattern. Each frame is divided into two sets, black and white blocks, like a chessboard pattern and a different search pattern, is defined for each set. The advantage of this definition is that the number of spatially neighboring blocks is increased for each current block and it leads to a better prediction for each block. Simulation results show that the proposed algorithm is closer to the Full-Search algorithm in terms of quality metrics such as PSNR than the other state-of-the-art algorithms while at the same time the average number of search points is less.info:eu-repo/semantics/publishedVersio

UBibliorum repositorio digital da ubi

A Two-Step Segmentation Method for Breast Ultrasound Masses Based on Multi-resolution Analysis

Author: Braz Rui
Moutinho José
Pereira Manuela
Pinheiro Antonio M. G.
Rodrigues Rafael
Publication venue: 'Elsevier BV'
Publication date: 28/02/2015
Field of study

Breast ultrasound images have several attractive properties that make them an interesting tool in breast cancer detection. However, their intrinsic high noise rate and low contrast turn mass detection and segmentation into a challenging task. In this article, a fully automated two-stage breast mass segmentation approach is proposed. In the initial stage, ultrasound images are segmented using support vector machine or discriminant analysis pixel classification with a multiresolution pixel descriptor. The features are extracted using non-linear diffusion, bandpass filtering and scale-variant mean curvature measures. A set of heuristic rules complement the initial segmentation stage, selecting the region of interest in a fully automated manner. In the second segmentation stage, refined segmentation of the area retrieved in the first stage is attempted, using two different techniques. The AdaBoost algorithm uses a descriptor based on scale-variant curvature measures and non-linear diffusion of the original image at lower scales, to improve the spatial accuracy of the ROI. Active contours use the segmentation results from the first stage as initial contours. Results for both proposed segmentation paths were promising, with normalized Dice similarity coefficients of 0.824 for AdaBoost and 0.813 for active contours. Recall rates were 79.6% for AdaBoost and 77.8% for active contours, whereas the precision rate was 89.3% for both methods.info:eu-repo/semantics/publishedVersio

UBibliorum repositorio digital da ubi

MPEG DASH - some QoE-based insights into the tradeoff between audio and video for live music concert streaming under congested network conditions

Author: Melvin Hugh
Pereira Manuela
Pinheiro Antonio M. G.
Pocta Peter
Rodrigues Rafael
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/06/2016
Field of study

The rapid adoption of MPEG-DASH is testament to its core design principles that enable the client to make the informed decision relating to media encoding representations, based on network conditions, device type and preferences. Typically, the focus has mostly been on the different video quality representations rather than audio. However, for device types with small screens, the relative bandwidth budget difference allocated to the two streams may not be that large. This is especially the case if high quality audio is used, and in this scenario, we argue that increased focus should be given to the bit rate representations for audio. Arising from this, we have designed and implemented a subjective experiment to evaluate and analyses the possible effect of using different audio quality levels. In particular, we investigate the possibility of providing reduced audio quality so as to free up bandwidth for video under certain conditions. Thus, the experiment was implemented for live music concert scenarios transmitted over mobile networks, and we suggest that the results will be of significant interest to DASH content creators when considering bandwidth tradeoff between audio and video.info:eu-repo/semantics/publishedVersio

UBibliorum repositorio digital da ubi

Severity classification in cases of Collagen VI-related myopathy with Convolutional Neural Networks and handcrafted texture features

Author: Carlier Robert-Yves
Pinheiro Antonio M. G.
Quijano-Roy Susana
Rodrigues Rafael
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/07/2022
Field of study

(C) 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Magnetic Resonance Imaging (MRI) is a non-invasive tool for the clinical assessment of low-prevalence neuromuscular disorders. Automated diagnosis methods might reduce the need for biopsies and provide valuable information on disease follow-up. In this paper, three methods are proposed to classify target muscles in Collagen VI-related myopathy cases, based on their degree of involvement, notably a Convolutional Neural Network, a Fully Connected Network to classify texture features, and a hybrid method combining the two feature sets. The proposed methods were evaluated on axial T1-weighted Turbo Spin-Echo MRI from 26 subjects, including Ullrich Congenital Muscular Dystrophy and Bethlem Myopathy patients at different evolution stages. The hybrid model achieved the best cross-validation results, with a global accuracy of 93.8%, and F-scores of 0.99, 0.82, and 0.95, for healthy, mild and moderate/severe cases, respectively.info:eu-repo/semantics/acceptedVersio

arXiv.org e-Print Archive

UBibliorum repositorio digital da ubi

Quality comparison of the HEVC and VP9 encoders performance

Author: Bernardo Marco
Fernandes Pedro
Fiadeiro Paulo
Pereira Manuela
Pinheiro Antonio M. G.
Publication venue
Publication date: 01/01/2017
Field of study

This paper reports a comparison between two recent video codecs, namely the HEVC and the VP9, using High Definition Video Sequences encoded with different bit rates. A subjective test for the evaluation of the provided Quality of Experience is reported. The video sequences were shown to a panel of subjects on a High Definition LED display and the subjective tests were performed using a Single Stimulus Methodology. The results shown that the HEVC encoder provides a better visual quality on low bit rates than the VP9. Similar performance was obtained for visually lossless conditions, although the HEVC requires lower bit rates to reach that level. Moreover, the correlation of the subjective evaluation and three tested objective metrics (PSNR, SSIM, and FSIM) revealed a good representation of the subjective results, particularly the SSIM and the FSIM metrics.info:eu-repo/semantics/publishedVersio

UBibliorum repositorio digital da ubi

Severity classification in cases of Collagen VI-related myopathy with Convolutional Neural Networks and handcrafted texture features

Author: Carlier Robert-Yves
Pinheiro Antonio M. G.
Quijano-Roy Susana
Rodrigues Rafael
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/10/2022
Field of study

© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Magnetic Resonance Imaging (MRI) is a non-invasive tool for the clinical assessment of low-prevalence neuromuscular disorders. Automated diagnosis methods might reduce the need for biopsies and provide valuable information on disease follow-up. In this paper, three methods are proposed to classify target muscles in Collagen VI-related myopathy cases, based on their degree of involvement, notably a Convolutional Neural Network, a Fully Connected Network to classify texture features, and a hybrid method combining the two feature sets. The proposed methods were evaluated on axial T1-weighted Turbo Spin-Echo MRI from 26 subjects, including Ullrich Congenital Muscular Dystrophy and Bethlem Myopathy patients at different evolution stages. The hybrid model achieved the best cross-validation results, with a global accuracy of 93.8%, and F-scores of 0.99, 0.82, and 0.95, for healthy, mild and moderate/severe cases, respectively.info:eu-repo/semantics/acceptedVersio

UBibliorum repositorio digital da ubi

Assessment of speckle denoising filters for digital holography using subjective and objective evaluation models

Author: Bernardo Marco V.
Fiadeiro Paulo
Fonseca Elsa
Pereira Manuela
Pinheiro Antonio M. G.
Publication venue: 'The Optical Society'
Publication date: 01/01/2019
Field of study

Digital holography is an emerging imaging technique for displaying and sensing three dimensional objects. The perceived image quality of a hologram is frequently corrupted by speckle noise due to coherent illumination. Although several speckle noise reduction methods have been developed so far, there are scarce quality assessment studies to address their performance and they typically focus solely on objective metrics. However, these metrics do not reflect the visual quality perceived by a human observer. In this work, the performance of four speckle reduction algorithms, namely the nonlocal means, the Lee, the Frost and the block matching 3D filters, with varying parameterizations, were subjectively evaluated. The results were ranked with respect to the perceived image quality to obtain the mean opinion scores using pairwise comparison. The correlation between the subjective results and twenty different no-reference objective quality metrics was evaluated. The experiment indicates that block matching 3D and Lee are the preferred filters, depending on hologram characteristics. The best performing objective metrics were identified for each filter.info:eu-repo/semantics/publishedVersio

UBibliorum repositorio digital da ubi